Multi-scale computing over heterogeneous resources
نویسندگان
چکیده
Parallel processing of large data sets on clusters of commodity machines—of virtual or physical nature—has in recent years become an active field of research as well as a thriving business. Many companies are running large data centres to support such workloads, and task-parallel frameworks have enabled programmers to harness managed large-scale parallelism while writing mostly sequential code. At the same time, the increase in individual per-core CPU performance has slowed down significantly and the industry has moved towards parallelism as a way of increasing CPU performance. Non-cache-coherent manycore CPUs are under active research and development, and the average commodity server now contains between four and eight logical CPU cores. On top of this, increasing awareness of data centre energy consumption as an environmental and economic challenge has led to diversification of hardware used in compute and storage clusters [1]. The systems used to run such jobs across many machines, however, essentially treat them as a homogeneous collection: they partition them into many VMs at the hypervisor level or coarse-grainedly break them up into a number of “slots” per machine. Nonetheless, heterogeneity in the data centre is a reality at multiple levels—from hardware and interconnects to task-level heterogeneity within a job [5]—and may even be introduced deliberately [2]. We believe that there has been no appropriate general treatment of resource heterogeneity in compute clusters so far. In this work, we introduce the concept of independent resource ensembles, marking a departure from the classic single global master design of taskparallel compute frameworks (§2), and propose a new approach to detecting and characterizing resource heterogeneity within and between ensembles (§3), enabling heterogeneity-aware scheduling decisions. In explicitly supporting heterogeneity-awareness and dynamic extension of a computation across resource ensembles, we introduce the idea of multi-scale computation—that is, we can exploit parallelism at many scales, from fine-grained multi-core processing with communication over an onchip network to coarse-grained clusters of virtual machines (§4).
منابع مشابه
An Effective Task Scheduling Framework for Cloud Computing using NSGA-II
Cloud computing is a model for convenient on-demand user’s access to changeable and configurable computing resources such as networks, servers, storage, applications, and services with minimal management of resources and service provider interaction. Task scheduling is regarded as a fundamental issue in cloud computing which aims at distributing the load on the different resources of a distribu...
متن کاملExploring Multi-level Parallelism for Large-Scale Spiking Neural Networks
Several biologically inspired applications have been motivated by Spiking Neural Networks (SNNs) such as the Hodgkin-Huxley (HH) and Izhikevich models, owing to their high biological accuracy. The inherent massively parallel nature of the SNN simulations makes them a good fit for heterogeneous computing resources such as the General Purpose Graphical Processing Unit (GPGPU) clusters. In this re...
متن کاملMulti-Layered Network Survivability – Models, Analysis, Architecture, Framework and Implementation: An Overview
A major attack can significantly reduce the capability to deliver services in large-scale networked information systems. In this project, we have addressed the survivability of large scale heterogeneous information systems which consist of various services provided over multiple interconnected networks with different technologies. The communications network portions of such systems are referred...
متن کاملGeneral-purpose coordinator-master-worker model for efficient large-scale simulation over heterogeneous infrastructure
In this work, we propose a general-purpose coordinator-master-worker (GP-CMW) model to enable efficient and scalable simulation. The model supports distributed, and parallel simulation over a heterogeneous computing node architecture with both multi-core CPUs and GPUs. The model aims at maximizing the hardware activity rate while reducing the overall management overhead. The proposed model incl...
متن کاملA MapReduce Framework for Heterogeneous Computing Architectures
Nowadays, an increasing number of computational systems are equipped with heterogeneous compute resources, i.e., following different architecture. This applies to the level of a single chip, a single node and even supercomputers and large-scale clusters. With its impressive price-to-performance ratio as well as power efficiency compared to traditional multicore processors, graphics processing u...
متن کاملA Cloud Computing Infrastructure on Heterogeneous Computing Resources
Cloud computing is a state-of-the-art distributed computing paradigm which can support on-demand service sharing with flexibility and scalability. Cloud computing provides sharable heterogeneous computing resources using internet and data storage on a third party server. In order to use the heterogeneous computing resources in a much more efficient, scalable and flexible way, a Cloud computing ...
متن کامل